Utility Pattern Approach for Mining High Utility Log Items from Web Log Data
نویسندگان
چکیده
. Mining frequent log items is an active area in data mining that aims at searching interesting relationships between items in databases. It can be used to address a wide variety of problems such as discovering association rules, sequential patterns, correlations and much more. Weblog that analyzes a Web site's access log and reports the number of visitors, views, hits, most frequently visited pages, and so forth. Mining frequent log items from web log data can help to optimize the structure of a web site and improve the performance of web servers. Existing methods often generate a huge set of potential high utility log items and their mining performance is degraded consequently. Two novel algorithms as well as a compact data structure for efficiently discovering high utility log items are proposed. High utility log items are maintained in a tree-based data structure called utility pattern tree. Implementing mining process is done through Discarding Local Unpromising Items and Decreasing Local Node Utility strategies. Experimental results predict that these strategies can keep track of previously accessed pages of a user, identify needed links to improve the overall performance of a web page, and improve the actual design of web pages with only two database scans.
منابع مشابه
High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملWeb Users Session Analysis Using DBSCAN and Two Phase Utility Mining Algorithms
One of the important issues in data mining is the interestingness problem. Typically, in a data mining process, the number of patterns discovered can easily exceed the capabilities of a human user to identify interesting results. To address this problem, utility measures have been used to reduce the patterns prior to presenting them to the user. A frequent itemset only reflects the statistical ...
متن کاملA New Algorithm for High Average-utility Itemset Mining
High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...
متن کاملA New Algorithm for Web Log Mining
The enormous content of information on the World Wide Web makes it obvious candidate for data mining research. Data Mining Technique application is used to the World Wide Web referred as Web mining where this term has been used in three distinct ways; , Web Structure Mining, Web Content Mining and Web Usage Mining. Web Log Mining is one of the Web based application where it will facing with lar...
متن کاملAn Rule Based Mining Database with Similarityon Large Probabilistic Graph Matching
-Mining frequent itemsets is an active area in data mining that aims at searching interesting relationships between items in databases. It can be used to address to a wide variety of problems such as discovering association rules, sequential patterns, correlations and much more. Existing methods often generate a huge set of potential high utility item sets and their mining performance is degrad...
متن کامل